Using Portfolio Theory for Automatically Processing Information about Data Quality in Data Warehouse Environments
نویسندگان
چکیده
Data warehouses are characterized in general by heterogeneous data sources providing information with different levels of quality. In such environments many data quality approaches address the importance of defining the term “data quality” by a set of dimensions and providing according metrics. The benefit is the additional quality information during the analytical processing of the data. In this paper we present a data quality model for data warehouse environments, which is an adaptation of Markowitz’s portfolio theory. This allows the introduction of a new kind of analytical processing using “uncertainty” about data quality as a steering factor in the analysis. We further enhance the model by integrating prognosis data within a conventional data warehouse to provide risk management for new predictions.
منابع مشابه
A Solution to View Management to Build a Data Warehouse
Several techniques exist to select and materialize a proper set of data in a suitable structure that manage the queries submitted to the online analytical processing systems. These techniques are called view management techniques, which consist of three research areas: 1) view selection to materialize, 2) query processing and rewriting using the materialized views, and 3) maintaining materializ...
متن کاملInvestigating the Role of Non-Financial Information Analysis and Risk- Return Analysis along with Financial Information in Increasing the Efficiency of the Stock Portfolio of Banks
The purpose of this study was to investigate the role of non-financial information analysis and risk-return analysis along with financial information in increasing the selected banks and financial institutions of Tehran Stock Exchange portfolio efficiency. To evaluate the efficiency of the portfolio, the Treynor's ratio was used and attempted to determine the Treynor's ratio of the selected opt...
متن کاملImprovement of the Analytical Queries Response Time in Real-Time Data Warehouse using Materialized Views Concatenation
A real-time data warehouse is a collection of recent and hierarchical data that is used for managers’ decision-making by creating online analytical queries. The volume of data collected from data sources and entered into the real-time data warehouse is constantly increasing. Moreover, as the volume of input data to the real time data warehouse increases, the interference between online loading ...
متن کاملCloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming
The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hos...
متن کاملLand Cover Subpixel Change Detection using Hyperspectral Images Based on Spectral Unmixing and Post-processing
The earth is continually being influenced by some actions such as flood, tornado and human artificial activities. This process causes the changes in land cover type. Thus, for optimal management of the use of resources, it is necessary to be aware of these changes. Today’s remote sensing plays key role in geology and environmental monitoring by its high resolution, wide covering and low cost...
متن کامل